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Abstract 

Consider n individuals who, by popular vote, choose among q > 2 alternatives, one of which 
is "better" than the others. Assume that each individual votes independently at random, and 
that the probability of voting for the better alternative is larger than the probability of voting 
for any other. It follows from the law of large numbers that a plurality vote among the n 
individuals would result in the correct outcome, with probability approaching one exponentially 
quickly as n — > oo. 

Our interest in this paper is in a variant of the process above where, after forming their initial 
opinions, the voters update their decisions based on some interaction with their neighbors in a 
social network. Our main example is "majority dynamics" , in which each voter adopts the most 
popular opinion among its friends. The interaction repeats for some number of rounds and is 
then followed by a population-wide plurality vote. 

The question we tackle is that of "efficient aggregation of information" : in which cases is the 
better alternative chosen with probability approaching one as n — > oo? Conversely, for which 
sequences of growing graphs does aggregation fail, so that the wrong alternative gets chosen 
with probability bounded away from zero? 

We construct a family of examples in which interaction prevents efficient aggregation of 
information, and give a condition on the social network which ensures that aggregation occurs. 

For the case of majority dynamics we also investigate the question of unanimity in the 
limit. In particular, if the voters' social network is an expander graph, we show that if the 
initial population is sufficiently biased towards a particular alternative then that alternative 
will eventually become the unanimous preference of the entire population. 

1 Introduction 



The mathematical study of voting systems began as early as 1785, when the Marquis de Condorcet[5] 
observed what is essentially a special case of the weak law of large numbers: suppose there is a 
large population of voters, and each one independently votes "correctly" with probability p > 1/2. 
Then as the population size grows, the probability that the outcome of a majority vote is "correct" 
converges to one. Thus, information is "efficiently aggregated". 
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In this work, we study a simple model of voter interaction, in which voters choose an independent 
random opinion initially, but then modify that opinion iteratively, based on what their friends 
think. Thus correlation between votes is introduced "naturally", through interaction. Our main 
example of interaction is majority dynamics, where at each round each voter adopts the opinion of 
the majority of its neighbors. The basic question that we address is that of efficient information 
aggregation: for which modes of interaction is information aggregated efficiently, and for which is 
it not? 

Additionally, we study some conditions for the achievement of unanimity, when the graph of 
social ties is an expander, and when agents use majority dynamics. 

1.1 Model 

We consider an election in which a finite set V of voters must choose between q > 2 alternatives, 
which we will take to be the elements of [q] = {0, 1, . . . ,q — 1}. The voters are connected by an 
undirected social network graph G = (V, E). Denote the neighbors of v € V by N v . 

Each voter v £ V will be initialized with a preference X v (0) 6 [q], picked independently from a 
distribution P over [q]. 

At time t £ {1, . . . , T}, v will update her opinion to X v (t) based on what her friends' opinions 
at times t—1 and earlier. At time T, an election will take place and a winner Y will be declared. 
Note that Y is a deterministic function of the initial votes (X v (0)) v& v 

A simple and important example is majority dynamics where q = 2: At each iteration of the 
dynamics, each individual v sets her vote to equal the most popular vote among her neighbors in 
the previous iteration (we elaborate below on the handling of ties). 

X v {t) = argmax ag | 01 | \{w\X w (t - 1) = a, w G JVt,}|. 

At some large time T an election by plurality takes place, so that the winner is 

Y = argmax a6{01} \{v\X v (T) = a}\. 

Note that the majority rule (or more generally the plurality rule, in the case of more than two 
alternatives) is fair and monotone: It is fair in the sense that is does not, as an election system, 
treat one alternative differently than another; it is invariant to a renaming of the alternatives. It 
is monotone in the sense that having extra supporters cannot hurt an alternative's case. 

As a generalization of majority dynamics, we allow any updating of opinions and any election 
system, provided that they are fair and monotone. For example, an individual may give more 
weight to some of her friends than to others, and the final election could an Electoral College 
system. In Sec. [5] we further relax the fairness condition. 

1.2 Overview of the results 
1.2.1 Social types 

Our study of information aggregation will utilize the idea of social type: we divide the voters V 
into a partition A of social types, and ask that any two voters of the same social type play the 
same role in the election process. More precisely we require that if the labels are removed from all 
individuals then it is impossible to tell apart two individuals of the same social type. This shall be 
rigorously defined in Section [2j 

In the case of majority dynamics, social types are induced by the automorphisms of the graph 
G: u,w 6 V are of the same social type if there exists an automorphism r of G such that t{u) = w. 



2 



Intuitively, this means that in an unlabeled drawing of G it is impossible to say which is u and 
which is w; u and w are of the same social type if they play the same role in the geometry of G, 
and hence play the same role in the election process. 

1.2.2 Aggregation of information 

Without loss of generality, we will assume that alternative is the best alternative, and that 
the initial opinion of each voter is slightly biased towards alternative 0: we take X v (0) to be a 
multinomial random variable such that P(X„(0) = 0) > P(X„(0) = j) for any j ^ 0. 

Although this bias could be very small, the law of large numbers guarantees that with enough 
voters, the outcome of a plurality vote at time would choose the correct alternative, except with 
exponentially low probability. We refer to this property as efficient aggregation of information. 

In Section [3l we study if information is still efficiently aggregated if we hold the vote at time 
T instead, after allowing the agents to interact. One of our main results (stated formally in 
Theorem 13.11 below) is that information is efficiently aggregated when each social type has many 
members. In particular, we show that the probability of choosing the correct alternative approaches 
one as the size of the smallest social type approaches infinity, with a polynomial dependence. 

This implies that in majority dynamics on a transitive graph, in which case all voters are of the 
same social type, the outcome of the final vote will be zero, except with probability that decreases 
polynomially with the number of voters. 

1.2.3 Lack of Aggregation 

Perhaps surprisingly, the condition requiring increasing size of each social type is necessary. Indeed 
in Section 13.11 we provide an example with q = 2 alternatives, majority dynamics and a final 
majority vote, which results in the wrong outcome, with constant probability regardless of the size 
of the population! 

1.2.4 Wider agreement, unanimity and expanders 

In Section \E{ we ask when, following T periods of interaction, a large part of the population is in 
agreement. 

Focusing on the case q = 2 and majority dynamics, we show that the proportion of the popula- 
tion that votes for alternative at time T is at least as large as the initial bias towards alternative 
0. 

We push the agreement threshold to its extreme in Section [61 where we show that if the social 
network is an expander graph, the mode of interaction is based on plurality, and there is enough 
initial bias then eventually the entire population will agree on alternative 0. 

1.3 Related work 

Our work is closely related to work of Kalai [15J who studies social choice using tools of discrete 
Fourier analysis. Kalai proves that any binary unbiased and monotone election system aggregates 
information efficiently, given that all the voters have low influence on the outcome. 

Our work expands on this work in several directions: First, we elucidate the role of voters types 
in this setup by showing that having large number of voters of each type implies aggregation and 
that without this condition aggregation may not occur. Second, we go beyond the binary world 
and explore general outcome spaces. Finally the questions of higher thresholds and unanimity were 
not considered before. 
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Kanoria and Montanary |17j study majority dynamics with two alternatives on regular (infinite) 
tree graphs, giving conditions which lead to convergence to unanimity. Their work can also be 
interpreted as a study of a zero temperature spin glasses, a model also studied by Howard [12] on 
3-regular trees and Fontes, Schonmann and Sidoravicius [7] onZ d . 

Berger [3] gives an example of a series of graphs in which majority dynamics results in the 
adoption, by all individuals, of the opinion of the individuals in a constant size group, provided 
they all agree. Thus these graphs could serve in place of our example (Section I3.ip . showing how 
aggregation fails when there is a small social type. We provide our example for completeness, and 
because it is somewhat simpler. 

Our work is related to the widely studied family of Gossip-based protocols on networks (see, 
e.g., Bawa et al. [3], Kempe et al. [18], and a survey by Shah [23]). The goal there is to design 
and/or analyze distributed, repeated algorithms for the aggregation of information on networks. 
For example, in the classical DeGroot model [6] agents "vote" with a real number, which they 
calculate at each iteration by averaging the votes of their neighbors from the previous iteration. 
The agents all converge to the same number, which is a good approximation of the average of the 
initial votes only if degrees are low [10] . or if, indeed, the size of the smallest social type is large. 
This model is fairly easy to analyze, since the votes in each iteration are a linear function of the 
votes in the previous iteration. Majority dynamics is a natural discretization of this process, but 
has proven to be more resistant to analysis. Indeed the non-linearity of the dynamics results not 
only in major technical challenges but also in different behaviors of the two models. 

Another related strain of models is that of Bayesian learning. Here the agents optimize their 
votes to those which are the most likely to be correct, given a prior over correct alternatives, an 
initial private signal and the votes of their neighbors in previous rounds (see, e.g., |21j). Perhaps 
surprisingly, this dynamic is not necessarily monotone and therefore its analysis requires different 
tools. The agents calculation there are more complicated, and hence more difficult to analyze. 
On the other hand, the optimality of the agents' actions makes the model amenable to martingale 
arguments, which don't apply in the case of majority dynamics. 

Our main proof uses tools from the field of Fourier analysis of Boolean functions on the discrete 
hypercube. In particular we use and extend results of Kahn, Kalai and Linial [13], Friedgut and 
Kalai [S], a strong version of the KKL theorem by Talagrand [21] and a recent generalization by 
Kalai and Mossel[16j. 

1.4 Acknowledgments 

We would like to thank Miklos Racz for his careful reading of the manuscript and his suggestions. 

2 Definitions and results 
2.1 Majority Dynamics 

Let V be a finite set of individuals. Let G = (V,E), an undirected finite graph, represent the 
network of social connections of V. We denote the neighbors of v G V by N v . We allow G to 
contain self-loops, so that v may or may not belong to N v . 

Let X v (t) G {0, 1} denote v'b vote at time t G {0, . . . , T}. Let each X v (0) be chosen from some 
distribution P over {0, 1}, independently and identically for all v G V. Note that once the initial 
votes (X v (0)) «gv are chosen, the process is deterministic. 

At times t > 0, v updates its vote to equal the majority opinion of its neighbors in the previous 
round. If the number of neighbors is even then we either add or remove v itself to the set of 
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neighbors N v , to avoid ties. 

X v (t) = argmax oe { 0)1 } \{w\X w (t - 1) = a, we N v }\. 

After some number of rounds T an election by majority takes place. We denote the winner by Yt'- 

Y T = argmax ag{01 | \{v\X v (T) = a}\. 

To avoid ties in the final election, we assume |V| is odd. 

We next define social types. Recall that r : V — > V is a graph automorphism of G = (V, E) if 
(u, v) G E o (t(u),t(v)) G E. We say that u and v are of the same social type if there exists a 
graph automorphism that maps u to v. Informally, this means that u and v play the same role in 
the geometry of the graph; it is impossible to tell which is which if the labels are removed from 
the vertices. It is easy to see that "being of the same social type" is an equivalence relation. We 
denote by A(G) the partition of the vertices of G into social types. We denote by m(G) the size of 
the smallest social type: 

m(G) = min \A\. 
AeA{G) 

Our main result in this section is that information is aggregated efficiently, provided that each 
social type has many members. To state our result, we first define the efficiency of an aggregation 
procedure. Let be the probability distribution {0, 1} such that P<5(0) = ^(1 + 5) and Ps(l) = 
^(1 — 5). Then the efficiency fj,g(G,T) of majority dynamics on G up to time T is 

to(G,T)=P 5 [Y T = 0]. 

Note that in a slight abuse of notation we use P^ to denote both the distribution over {0, 1} from 
which X v (0) is chosen, and the measure on (X v (t)) V £V,i<t<T and Yt which is induced by P5. 
Our main result for this section is the following: 

Theorem. There exists a universal constant C > such that for any graph G 

in r\ ^ 1 n ( n ^ogm(GY 
fj,s{G, T) > 1 — C exp I — C 



log(l/5) 

In particular, ^s{G) approaches one as m(G) tends to infinity. Note that the bound does not 
depend on T. This theorem is a special case of Theorem 13.11 which is stated below. 

In the other direction, we provide an example showing what can go wrong when m(G n ) does 
not grow to infinity. 

Theorem 2.1. For any 5 > 0, there exists a sequence of graphs G n , whose sizes converge to 
infinity, such that 

supsup// 5 (G n ,T) < 1. 

n T>1 

That is, there is some e > such that for any n and T the probability of choosing the wrong 
alternative is at least e. 
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2.2 Monotone Dynamics 

In this section we extend the definitions and results of the previous section to a large class of update 
rules and election systems, and a choice between more than two alternatives. 

Let [q] = {0, 1, . . . , q — 1} be the set of alternatives. The initial votes X v (0) are, as above, chosen 
i.i.d. from some P, which is now a distribution over [q]. As before, the process is deterministic once 
the initial votes are chosen. 

Let the history of u's neighborhood before time t be denoted by H v (t) = (X w (s)) s< t tW eN v - Then 
[g][*l xAr « ig the set of possible histories of the neighborhood of v before time t. 

For each a G [q] and k G N we define a relation > a on [q] k as follows. Let x, x' G [q] h . We write 
x' > a x if, for all i G [k] it holds that 

x ^ -f— x^, y X — CI* 

Alternatively, if a vector of votes x is changed to x' such that x' > a x, then for each i either X{ is 
unchanged, or it is changed to a. Note that when q = 2 then x' >\ x reduces to the usual x' > x, 
i.e., x\ > Xi for i G [&]. 

For < i < T, let be determined as follows. Let the mode of interaction be a collection 

of functions m v>t : [q]^ xNv ->■ [g], with 

= m Vi t(H Vtt ). 

These functions are generalization of the majority function used in majority dynamics. As such, 
we require that they meet the following conditions: 

1. They are fair, or symmetric with respect to the alternatives: for all permutations a on [q] 
and all histories h G [g][*] xAr «, <r(m v j(h)) = m v j(a(h)), where a(h) is the result of applying 
a to each element of h. 

2. They are monotone: for every pair h, h' G [gjM^", if m v j(h) = a and h' > a h then m v j(h') = 
a. 

An example would be majority dynamics, i.e., the case where q = 2, |JV„| is odd for all v, and m Vj t 
is equal to the most popular opinion among X w (t — 1), where w G N v . A different simple example 
is the case that X v (t) is simply equal to X v (t — 1), unless all of u's neighbors agree in time t—1 
on some alternative a, in which case X v {t) = a. That is, the agents do not change their opinions 
unless their friends unanimously agree on a different opinion. 

Following T rounds of interaction, we apply an election system function g : [q] v — > [q] to 
(Xi(T), . . . ,X|y|(T)), to determine the chosen alternative Y: 

Y = g(X 1 (T),...,X lvl (T)). 

This is again a generalization of a majority vote, and as such we require that g satisfy the same 
fairness and monotonicity properties: 

1. It is fair, or symmetric with respect to the alternatives: for all permutations a on [q], a{g{a)) = 
g(o-(a)), where <r(a) is the result of applying a to each element of a, a v . 

2. It is monotone: for every pair x,x' G [q] v , if g(x) = a and x' > a x then g(x') = a. 
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Examples of such functions are the simple plurality function and various recursive plurality (i.e., 
electoral college-like) functions. Another important example is the dictator function, in which 
g (a) = a v , for some fixed v. 

The whole process of social interaction and elections can be viewed as a single function from 
the original signals {X v (0)\v G V} to [q\. We denote this function by / : [q] v — > [q], and call it the 
aggregation function, so that 



Note that for brevity's sake we sometimes write the above as Y = f(X). It is easy to see that the 
aggregation function / has the same properties we that require from the election system g: it is 
monotone and fair. 

Finally, the concept of social types is, in the case of monotone dynamics, related to the sym- 
metries of the aggregation function, rather than those of the graph. We first define rl(f), the 
symmetry group of the aggregation function, as the group of permutations r on V that satisfy the 
following condition: for every a G [q] it holds that /(r(a)) = /(a), where r{a) v = a T i v \. 

It is easy to verify that M(f) is indeed a group, with composition as the operation: for any 
r,a G H(f) it holds that /(r(<r(a))) = /(r(a)) = /(a), and hence ra G H(f). Also, f(a) = 
/(r(r _1 (a))) = /(r-^a)), and so r" 1 is also in H(f). 

The set of Social types is simply V/H(f), the set of orbits of V under the action of H(f). I.e., 
A(f) is the unique partition of V such that v, w G V are of the same social type iff 3r G rl(f) such 
that t(v) = w. 

The definition of m(G,T) now naturally becomes the following. Given an aggregation function 
/ : [q] v — > [q], denote by m(f) the size of the smallest social type induced by /: 



Our main result of this section, which is a strict generalization of that of the previous, is again 
that information is aggregated efficiently provided that each social type has many members. In the 
case of monotone dynamics, our definition of the efficiency of aggregation is the following. Let Vs 
be the set of probability distributions P on [q] under which P(0) > P(i) + 6 for alH = 1, . . . , q — 1. 
Then the efficiency fi$(f) of a function / : [q] n — > [q] is defined by 



Our main result for this section is the following: 

Theorem 13. 1L Let f : [q] v —¥ [q] be a monotone and fair aggregation function, and let m = m(f) 
be the size of the smallest social type. Then 



for some C q that depends only on q. 

Theorem 13.11 is a statement about functions / such that m(f) is large. For q = 2 and odd n 
it is easy to find examples of such functions - the majority function, for example. However, not 
for every value of q, n and m < n there exists a fair and monotone aggregation function f q ^ n such 
that m = m(f q ^ n ). In particular, it is not clear for which values of q and n there exists a fair and 
monotone aggregation function that is transitive, i.e., m(f q ^ n ) = n. 



Y = g(X 1 (T), X ]vl (T)) = /(A^O), . . . , X ]vl (0)). 



m(f) = min 



AeA(f) 



W (/)= inf P[/(X) = 0]. 
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The challenge is to break ties in a way that preserves fairness and transitivity, and indeed 
it seems that no simple, immediate examples exist. We provide the following example of a fair, 
transitive and monotone function, for any q > 2 and n prime and larger than q. See further 
discussion in pQ. 

Proposition 2.2. For all q > 2 and n prime and strictly larger than q, there exists a monotone, 
fair and transitive aggregation function f : [q] n — > [q]. 



2.3 Unanimity results 

Here we consider any number of alternatives q, but specialize to the case where the mode of 
interaction is given by simple plurality. It is easy to construct examples showing that our earlier 
assumptions on the structure of the network do not imply that the whole electorate will eventually 
agree. Indeed, there could be a small clique of voters who are well-connected to each other but 
poorly connected to the rest of the population. These voters could forever maintain an opinion 
contrary to that of their peers. One way to avoid this situation is to ask that the social network 
be an expander graph. 

Let M be the adjacency matrix of a (i-regular graph G. We say that G is a A-expander graph 
if the second-largest absolute eigenvalue of M is no larger than A. 

Although we will not require any knowledge of expander graphs here, we refer the uninitiated 
reader to [H] for a survey on the topic. For now, it is enough to know that "good" expanders have 

A = o(Vd). 

Theorem 2.3. Let G n be a sequence of d-regular X-expanders whose size converges to infinity. 
JL 
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Suppose that 4 < A ant 



P(0) > P W + 2*™ (1) 

for all i ^ 0. For v a vertex in G n let X v (t) be drawn i.i.d. from P, and let the mode of interaction 
be majority dynamics. Then with probability converging to 1 as n — )• oo, there exists a time T such 
that X V (T) = for all v €V. 

The dependency on d in Eq. (Q]) is possibly not tight. In particular, if q = 2 and the girth of G n 
tends to infinity with n, then a result of Kanoria and Montanari [IT] implies that we can replace 
\[d~ by d a for any a > 0. 



2.4 Higher threshold results 

For q = 2, consider the election system g a (x) = l(Yli x i > (1 — a)n). When a = 1/2, this is just 
the simple majority function. It is monotone and symmetric and so Theorem 13.11 applies. When 
a > 1/2, however, g a is no longer symmetric in the alternatives. We prove that the final bias is as 
large as the expected initial bias. 

Theorem 2.4. Let f n>a be a fair and monotone aggregation function with election system g n>a on 
the graph G n after running T rounds of interaction. If m(/ rijQ ) — > oo and a < \ + | then for any 
T € N, 

lim Hs{f n ,a) = 1- 

n— >oo 

We do not believe that the relationship between a and 5 is the best possible. Note that for the 
complete graph on n nodes, one can take a exponentially close to 1 for any 5. It is natural to guess 
that the worst dependence on n occurs in a ring. For this case we show that one can take a as 
large as 1 - (1 - 5) 2 /2. 
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3 Aggregation of Information 



In this section, we will prove the following theorem, using the definitions of Section 

Theorem 3.1. Let f : [q] v —> [q] be a monotone and fair aggregation function, and let m = m(f) 
be the size of the smallest social type. Then 



w(/ ) £1 -C,ex P (- Cji ^) 



for some C q that depends only on q. 

The proof of this theorem relies on a "sharp threshold" theorem of Kalai and Mossel |16j (which 
is itself an extension of Talagrand's theorem [23] to the case q > 2). Sharp threshold theorems 
go back to Margulis [19] and Russo [22] ; Friedgut-Kalai [8] and Kalai [H] apply sharp threshold 
theorems in contexts similar to this one. In fact, the result of |14j gives a weaker version of 
Theorem 13.11 in which each social type must have at least n/o(logn) members. 

A crucial ingredient for sharp threshold results is the notion of influence, which we will define 
for a function / : [q] n — > {0, 1}. Let P be a probability measure on [q], and denote also by P the 
corresponding product distribution over [q] n . The influence of voter i on a function / : [q] n — > {0, 1} 
is 

P P (f) = E P Varp(/(Xi, . . .,X n )\X 1} . . .,X^ u X i+u . . .,X n ). (2) 
Kalai and Mossel [16] prove the following inequality: 
Theorem 3.2. Suppose that P(a) > a > for every a G [q\. If maxj ij (i) < e then 

for a universal constant C . 

Before proving Theorem l3.1l we will require a simple definition and Lemma. Let P be a probabil- 
ity distribution on [q] such that P(0) > 0. Define the following family of distributions P< (indexed 
by t G [0, 1]) as follows: 



Pt(o) 



t a = 

(l-t)P(o|o^0) a>0. 



Note that Pp(o) 



Lemma 3.3. Let P be a probability distribution on [q] such that P(0) = P(l) + S for some 5 > 0. 
Let s be such that P s (0) = P s (l). Then 

P(fl)-8>6/2. (3) 
Proof. We can solve for s to find that s = (P(0) — 5)/(l — 5). Hence 

Where the inequality follows from the fact that since P(0) = P(l) + 5 < 1 — P(0) + (5, it holds that 
l-5<2-2P(0). □ 
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We prove Theorem 13.11 below by calculating the derivative of Pt(/ = 0) with respect to t and 
then integrating between t = s and t = P(0). We thus interpolate between P s , in which the 
probability of and a are equal, and P (= Pp( )), in which the probability of is larger by 6 than 
the probability of a. 

For a function g and a probability measure P, we will write P(g) for the expectation of g under 

P. 

Proof of Theorem \3.1\ Since the conclusion of the theorem is only weakened when 6 is reduced, 
we can assume without loss of generality that the inequality P(0) > P(i) + 6 is tight and that 
P(l) + 6 = P(0). Choose s G [0,P(0)] so that P s (0) = P s (l). 

Define g = l(/=o)- Suppose (for now) that P(6) > ^ for all b G [q], and so P*(6) > ^ for all 
s < t < P(0) and all b G [q\. Since / is fair and monotone, P s (g) > 1/q. Using monotonicity again, 
Pt(s) > l/q for all t > s. 

By Theorem 13.21 if e t := max ig r n i Ip t (g) < 1/10 for all i then 

^ ciogOAi) ciogOAO 

yi Pt (g) > - — , , , Var Pf (g) > — — —— P t (l -g) 

^ log(2g/<5) glog(2g/5) 



for all t G [s, P(0)]. Now, recall that for ^4 G A, if i, j G A then they play the same role in / and 
in particular have the same influence. Hence X^«=i -^p (o) — me t, since \ A\ > m for any A G A. In 
particular, if > (log m)/m then ^i-^Ptlfl 1 ) — logm; on the other hand, if et < (log m)/m then 
the display above implies that 

^ /p ' (! " £C «i^ p ' (1 - 9) ' (4) 

for some C q that depends only on q. This last inequality (Eq. H|) holds, therefore, in either case. 
On the other hand, Lemma 2.3 of [16] (a generalization of Russo's formula) gives 



at 



i=i 

and so 

for all t G [s,P(0)]. Integrating between s and t, we have 

P t (g)>l-lexp(-C q -^-(t-, 

q V log(l/(5) 

and so we conclude by setting t = P(0) and invoking Eq. ([3]). 

Now, if the hypothesis P(b) > ^ fails then we construct P by P(0) = P(0) - 6/2 and P(6) = 

P(6) + 2(q-i) ^ or b ^ 0. Setting 6 = 6/2, we see that P satisfies the hypothesis of the theorem (with 

6 replaced by 6) and it also satisfies P(b) > The proof goes through, then, and we can absorb 
the extra factor of 2 into the constant C q . □ 
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3.1 Where aggregation fails 

Let q = 2 and suppose that both the interaction mode and the election system are given by simple 
majority votes. In this scenario, we prove Theorem 12. II by giving an example with two social types, 
one of which has a constant size as n — > oo. Information will not aggregate asymptotically in this 
example, and the reason for the failure will be the presence of the constant-sized social type. 

Since q = 2, it will be more convenient to set p = 5 + 5 = P(0) and to write our example in terms 
of p instead of in terms of 5. Let G n = (AuB,E), where \A\ = 1/(1— p) and \B\ = n(l/(l — p) + 1). 
Then in particular the number of vertices in G n is at least n. We assume here that 1/(1 — p) is an 
integer. 

Let each a £ A be connected to each b 6 B, and let none of the vertices in A be connected to 
each other. The vertices in B are arranged in n cliques, each of size 1/(1 — p) + 1, and there are no 
edges between the cliques. Each vertex in B has a self-loop. 

The degree of the vertices in B is odd, since each has edges to 2/(1 — p) + 1 edges. To make 
the degrees in A odd add a vertex that is connected to all vertices in A. An isolated vertex can be 
added to make the total number of vertices odd. 

Henceforth we condition on the event that X v (0) = 1 for all v S A. Note that this happens 
with probability (1 = (1 - p)V(i-p) . 

Let C be one of the cliques of B. If at least one vertex uu in C votes 1 initially (at time t = 0) 
then all the vertices in C will vote 1 in the next round (t = 1); each will have at least 1/(1 —p) + 1 
neighbors ({w} U A) that vote 1 and at most 1/(1 — p) neighbors (B \ {w}) that vote 0. The 
probability that at least one vertex in C votes 1 initially is 1 — p 1 /( 1 ~ p \ which is greater than 
1 — 1/e, or about 0.63. Hence the number of cliques in which all vertices will vote 1 at time 1 will 
be distributed Binom (n, 1 — p 1 ^ 1- ^), which dominates the distribution Binom (n,0.6). 

By Hoeff ding's inequality, the probability that a majority of the cliques (and hence a majority 
of the vertices) will vote 1 at time 1 is at least 1 — exp(— 0.02n). Once this happens, the vertices in 
A will all vote 1 in all future iterations, and so will these cliques. Hence for all T > 2 a majority 
vote will result in 1. 

The event that a majority of the cliques have a voter that initially votes 1 is independent of 
the event that all vertices in A initially vote 1. Hence both events happen with probability at least 
(1 -p) 1 /(i-p)(l- eX p(-0.02)). Since this quantity is positive and independent of n, it follows that 
information does not aggregate and Theorem 12. II is proved. 

Berger [3] constructs an example of a family of graphs with n vertices. In each graph there exists 
a set of at most 18 vertices (which he calls a dynamic monopoly), such that if all agents in this set 
initially vote identically then, in majority dynamics with two alternatives, all the agents converge 
to the initial vote of the dynamic monopoly. In particular, this implies that in this example, with 
probability at least (1— p) 18 , aggregation fails for any n. This is another example of how aggregation 
can fail when a particular social type has a small size (in this case at most 18). 

4 The existence of monotone, fair and transitive aggregation func- 
tions 

Proposition 4.1. For all q > 2 and n prime and strictly larger than q, there exists a monotone, 
fair and transitive aggregation function f : [q] n — > [q]. 

Proof. Let q > 2 and let n > q be prime. Let / : [q] n — > [q] be defined as follows. 

For a = (ao, . . . , a n -i) £ [o\ n let Q(a) be the set of alternatives that received the most votes. If 
Q(a) = {b} is a singleton then let f(a) = b. Otherwise |Q(a)| > 2. Let M(a) C [n] be the set of 
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voters that voted for one of the alternatives in Q(a). Note that |M(a)| / n, since otherwise each 
alternative received the same number of votes and so |Q(a)| divides n, which is impossible since 
n is prime. Also, M(a) is clearly not the empty set, and so |M(a)| is an invertible element of the 



where addition and division are taken over the field 7L n . Note that k{a) is the "average" position 
of a voter that voted for one of the votes that received the most votes. Let 



By definition f(a) G Q(a), and so / is the plurality function with some tie breaking rule, and is 
therefore monotone. Also, none of the alternative names appear in its definition, and it is therefore 
fair. It remains to show that it is transitive. We do this by showing that for each < i\ < ii < n 
there exists a permutation r = Ti 1: i 2 on [n] such that t{i\) = ii and /(r(a)) = f(a), where 
r(a) = (a r ( ), . . . ,a T („_i)). 



Let Ti lt i 2 (i) = t(i) = i — i\ + %2 mod n. Note that Q(r(a)) = Q(a) and that M(r(a)) = 
T- l (M(a)j, so that \M(r(a))\ = \M(a)\. Hence 



field Z n . Let 




t(a) = min{0 < i < n : k(a) + i G M(a)}, 



where again the sum k(a) + i is taken over Z n . Finally, define 



f(a) = a fe ( a ) + £ (a ). 




By a change of variables we get that 





Next, 



£(r(a)) 



min{0 <i<n: k(r(a)) + i G M(r(a))} 

min{0 < i < n : k(a) + i\ — ii + i G M(a) + i\ — 12} 

i{a), 



and finally, since r(i + j) 



r{i)+j: 



f(r(a)) = a T 



(k(r(a))+£(r(a))) °r (fc(r(o))) +i{a) 



a k{a)+£(a) - f( a )- 



□ 
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5 On higher thresholds of agreement 

In this section we again specialize to the case of q = 2 alternatives, and consider the question 
of when it can be shown that, after a number of rounds of fair and monotone dynamics, a large 
proportion of the population will agree on the correct alternative. 

Consider the election system g a (x) = l(Yli x i — (1 ~~ a)n). When a = 1/2, this is simply 
the majority function, and so our earlier results apply, and under the appropriate conditions Y = 
g(Xi(T), . . . , JT|y|(T)) will equal with high probability. What about when a > 1/2? In this case 
Y will equal only if an a fraction of the population votes at time T. When does this happen 
with high probability? 

Since g a satisfies the same transitivity properties as 51/2, the proof of Theorem 13. II mostly still 
applies. At least, the "sharp threshold" part of the claim is still true: there is some p* £ (0, 1) such 
that P(0) > p* implies that P(Y = 0) — >w/ n ) 1- Since g a is no longer anti-symmetric, however, 
we no longer know that the threshold occurs at p* = 1/2. 

In this section, we will show that p* < a, but we will also give a simple example for which 
p* = 1 — 0((1 — a) 2 ) as ol — y 1. Thus, there may be a large gap between our bound and the true 
behavior of p* . 

The first step is to obtain a lower bound on E]T\X„(i). The argument here appeared in a 
course taught by the first author in Fall 2010, although it may have been known before then. In 
any case, we give a proof for completeness. For the rest of this section, P p denotes the probability 
distribution on {0, 1} satisfying P p (0) = p, in which case S = 2p — 1. As above, we also denote by 
P p the distribution over n i.i.d. random variables distributed P p . 

Lemma 5.1. Let f : {0, l} n — > {0,1} be a monotone function with Pi/2(/ = 0) > |. Then 
P p (f = 0)>pforallpe[±,l]. 

Note that equality holds for the function f(x) = X{. In other words, every monotone function 
aggregates information at least as well as a dictator function. It is easy to construct less pathological 
examples that come arbitrarily close to achieving this bound. 

Proof. By the chain rule, 

n 

Pp(f(Xi, . . . , Aj_i, 0, X i+ i, . . . , X n ) — f(Xi, . . . , X{-i, 1, X i+ i, . . . , X n )) 

i=l 

1 - 

By the Efron-Stein inequality, ^2l l (f) > Var(/), with equality only if / depends just on one 
coordinate. If / depends just on one coordinate, then the proof is trivial, so we can suppose the 
contrary. Thus |P p (/) < --^Varp^/). 

Suppose, for a contradiction, that 1 — P p (/) = P p (/ = 0) < p for some p > 4. Let r be the 
infinum over all p satisfying the previous sentence. Since P p (/) is a smooth function of p, it follows 
that P r (/) = 1 — r and so Varp r (/) = r(l — r). Thus, J^P p (/)| p=r < —1, contradicting the 
assumption that Pp(/) > 1 — p for arbitrarily close p > r. □ 

Note that for any vertex v and any t, the conditions of the lemma hold for / = X v (t). Summing 
over all v, we obtain the following: 



dp 
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Corollary 5.2. Suppose that X„(0) are independent Bernoulli variables with mean p > \. Then, 
for any t, 

nJ2Xv(t)< (i-p)\v\. 

vev 

Combining this with the proof of Theorem 13 .1\ we arrive at the promised bound on the location 
of the sharp threshold. Of course, this is just a restatement of Theorem 12.41 



Corollary 5.3. Let f n : [q] n — )• q be a sequence of aggregation functions with monotone and fair 
modes of interaction and election system g a as defined above. Suppose that liniyi^oo m(/ n ) = oo ; 
and that p > a. Then P p (Y = 0) — > 1 as n — )• oo. 

Proof. For the sake of brevity, denote g e = g e (X\(T), . . . ,X\y\(T)) (which equals Y for e = a). 
From the proof of Theorem we have 

dP P (g a = 0) 

> C(logm)Var Pp (5( Q ). 

On the other hand, Corollary 15.21 gives us that for any e > 

P p (g p . e = 0) = P p (J2x v (t)<(l-p + e)\V\)>e 

and so Var Pp (g p ^ e ) > eP p {g p - e ) for any e. 

Fix q < p and set e = (p — a)/2. Then for any r G [a + e,p], Varp r ((7 a ) > eP r (g a ) and so we 
can solve the differential inequality 

dP r (g a = 0) 

— > Ce(logm)P r (5- a ) 

in the range [p — e,p], with initial condition P p _ e (g a = 0) > e. We obtain 

P P (Y = 0) = P p (g a = 0) > 1 - (1 - e) exp (-Ce 2 log m)) 
and we send m — > oo. □ 



5.1 An example: cycles 

Let G n be a cycle on n vertices, where each vertex has a self-loop, and recall that p = ijjp. 
When the mode of interaction is majority dynamics, we can explicitly calculate the distribution 
of lim^oo X v (t). This will yield a wider bound (compared to Theorem I2.4[) on the range of a for 
which lim n _ >00 ns(f n ,a) 1- Of particular interest are the cases when S — )■ or 5 — > 1; for small 5, 
a < ^ + |<5 — J7(5 3 ) turns out to imply Imin^^ [is(fn,a) — > 1, while for large <5, if we set e = 1 — 5, 
then a < 1 — ^e 2 is sufficient. Therefore, the bound in Theorem 12.41 is not tight: for 5 close to zero, 
one can take a ~ | + |<5 while Theorem 12.41 only guarantees that a = 5 + ^<5 will work; for 5 close 
to 1, a « 1 - \e 2 is sufficient, but Theorem 12.41 only gives a = 1 — ie. 

The analysis of the cycle is relatively simple because the eventual state of the voters can be easily 
foretold from the initial state. First of all, whenever two (or more) adjacent voters share the same 
opinion, they will retain that opinion forever. Moreover, strings of voters whose opinions alternate 
will gradually turn into strings of voters with the same opinion, as in the following example: 

time t •••11010100--- 
time t+l---11101000--- 
time t + 2---11110000--- 
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In fact, one can tell the eventual opinion of a voter v with the following simple rule: let V > be 
the smallest number such that X v _y = X v -y-\ an d let W > be the smallest number such that 
X v+ \y = X v+ w + i (assuming that such V and W exist, which will only fail to happen in the unlikely 
event that the whole cycle consists of alternating opinions). If V < W then X v (t) = X v -y(0) for 
all t>V. On the other hand, if W < V then X v (t) = X v+W (0) for all t > W. (If V = W then 
X v -v(0) = X v+ w(0) because X v -y(0) = X v (0) if and only if V is even, and similarly for W .) 

Proposition 5.4. For any v, 

2p 2 -p 3 1 5r5 — J 3 4e 2 - e 3 



lim lim P(XJt) = for all t > T) = — ^ = - + - , 

r^oon^oo K VK ' J ~ ' i-p + pt 2 6 + 25 2 8-4e + 2e 2 

As we observed following Corollary 15.21 this implies that if a < | + and the number of 

interaction rounds is sufficiently large (depending on a and p), then fJ,s(f n ,a) ~~ ^ 1- 

Proof. For brevity, we will write X v instead of X v (0) for the initial state of vertex v. Instead of 
majority dynamics on the cycle, consider majority dynamics on Z; we will see later that these are 
essentially the same when n is large. We may assume without loss of generality that v = 0. As in 
the discussion above, let V > be minimal such that X_y = X-y^i and let W > be minimal 
such that X\y = Xyy + i. 

Let us first condition on Xq(0) = 0. Consider the i.i.d. sequence 

Yfc = (X^2k, Xi^2k, X 2 k-1, X 2 k) S {0, l} 4 . 

If Yi, . . . , Yj = (0, 1, 1,0) then the sequence X_2j, ■ ■ ■ , X 2 j consists of alternating zeros and ones, 
and so V, W > 2j. Define Aq,Ax C {0, l} 4 by 

A = {(a, b, c, d) : b = or c = 0} 

A\ = {(a, b, c, d) :a = b = c= lorb = c = d=l} 

Note that A n A x = and {0, l} 4 \ (A U A x ) = {(0, 1, 1, 0)}. Therefore, if J is minimal such that 
Yj / (0, 1, 1, 0) then Yj is in either A or A x . If Yj G A then either = 2 J - 2 and X w = 0: 

Xq X\ X 2 X% X4 • • • X 2 j-3 X2J—2 X2J-1 
1 1 ••• 1 

or V = 2 J - 2 and X_y = 0: 

-X-(2J-1) -^-(2.7-2) X_( 2 J-3) '•• -^-4 -^-3 -X-2 X-i X 

1 ••• 1 1 

In either of these cases, Xo(t) = for all t > 2 J — 2. Conversely, if Yj G ^i then either AT^/ = 1 
or Xy = 1 and Xq{€) = 1 for all t > 2J — 1. Thus, (using the fact that J and Yj are independent) 

P(X (t) = for all t > T \X = 0) = P(Yj G A )P(2J — 2 < T). (5) 

Since the Yj are i.i.d, 

Pry, f i) = p ( y i ! A>) = 2p-p 2 2p-p 2 

^ jt p(y lG ^ oU A 1 ) 2jj-p 2 + 2(l-p) 3 -(l-p) 4 1-jj 2 + 2jj 3 -p 4 ' {) 
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where we have computed P(Yi £ Ai) by the inclusion/exclusion formulas 
P(Yi G A ) = P(X_i = 0) + P(Xi = 0) - P(X_i = X 1 = 0) 

P(Fi G A x ) = P(X_ 2 = X_i = Xx = 1) + P(X_i = Xi = X 2 = 1) - P(X_ 2 = • • • = X 2 = 1). 

The case for Xq = 1 is similar: we define 

A = { (a, 6, c, d) : a = b = c = or b = c = d = 0} 
A'x = {(a,b,c,d) : b = 1 or c = 1}. 

If J' is minimal such that Yj> ^ (1,0,0, 1) then Y> G ^4' implies X (t) = for t > 2 J' - 1, while 
Y r G A[ implies X (i) ->■ 1 for t > 2J' - 2. Since P(Yi G A' ) = 2p 3 - p 4 and P(Yx G A'J = 
2(1 — p) — (1 — p) 2 , we have 

P(X (t) = for all t > T\X = 1) _ P(Yx G ^) = 2p 3 - p 4 

P(2 J — 1 < T) P(Yi G A' n Ai) 1 - p 2 + 2p 3 - p A ' U 

To transition back from dynamics on Z to dynamics on the n-cycle, note that the event {Xq{€) = 
for all t > T} is the same event on Z and on the n-cycle, provided that n > 2T. In particular, ([5]) 
and ([6]) imply that 

lim P(X (t) = for alU > T \X = 0) = P(2J - 2 < T)- ^ P ~ P q T 

n— s>oo 1 — p z -|- 2p — p 

for majority dynamics on the n-cycle (and similarly conditioned on Xq = 1, using ([7]). Since 
lim r _ >00 P(2J-2 < T) = 1, 

lim lim P(X (t) = for alii > T \X = 0) 2/ ' ~ 



T-^oo n-5-oo 1 — p 2 + 2p 3 — p 4 

(and similarly conditioned on Xq = 1). Finally, 
P(Xo(t) = for all t > T) 

= pP(X (t) = for all i > T\X = 0) + (1 - p)P(X (i) = for all t > T\X = 1) 



2p 2 — p 3 



1 — p + p 2 

as T, n — > oo. The formulas in terms of S and e are obtained by substituting p = ^4^- = 1—1. □ 



6 Expander graphs converge to unanimity 

6.1 Majority dynamics with two alternatives 

In this section we again consider the case that q = 2 and majority dynamics (i.e., each voter adopts 
the majority opinion of its neighbors), with a population wide majority vote at time T. To avoid 
the issue of ties, we assume that \N V \ is odd for all v and that n is odd. 

Let G be a graph and M its adjacency matrix, so that M vu is 1 if (u, v) G E and otherwise. 
We say that G is a A-expander if the second-largest absolute eigenvalue of M is at most A (cf . [11] ) . 
Expander graphs have particularly nice properties under the iterated majority dynamics. One 
reason for this is that in an expander graph, the number of edges between disjoint sets A and B of 
vertices is almost completely determined by the cardinalities of A and B. We state this formally 
in Lemma 16.11 below. 
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Denote E(A,B) = l^Mlg, where A and B be sets of vertices. Note that if A and B are 
disjoint then E(A,B) is the number of edges between A and B, and if ^4 and B are not disjoint, 
then E(A,B) double-counts edges from Af]B to itself). Alternatively, E(A,B) is the number of 
"edge-ends" of edges with one end in A and another in B. 

Recall that a graph (i-regular if all vertices have degree d, i.e., |iV v | = d for all v £V. 

Lemma 6.1 (Expander mixing lemma (cf. [2j)). If G is a d-regular X-expander with n vertices 
then 

\A\\B\d 



E (A,B)- ] -^^- <A^4p| 
n 



for every A, B C G. 



It follows easily from the expander mixing lemma that medium-sized majorities are unstable 
under iterated majority dynamics. That is, if a reasonable majority of people prefer one outcome 
then very quickly a large majority of people will prefer that outcome. 

Proposition 6.2. Let q = 2, let n be odd, let G be a d-regular X-expander with d odd, and let the 
mode of interaction be majority dynamics with a majority vote at time T. 

Let No(t) be the number of agents that vote at time t and let Ni(t) be the number of agents 
that vote 1. If N (t) > N^t) + an then Ni(t + 1) < J^n. 

Proof. Let Ao(t) be the set of agents that vote at time t, and define A±(t) similarly. Then, by 
the nature of majority dynamics, every v 6 A%{t + 1) has more than half of its neighbors in A\(t). 
Summing over every v € Ai(t + 1), we have E{A\(t+ 1), A\(t)) > E(A\{t + 1), A (t)). By applying 
the expander mixing lemma to both sides, 



m(t + l)N (t)d 



xVN.it + Tjw) < Nl{t + 1)Nl{t)d + xVm + TjNfij- 



n n 
Rearranging, and since No(t) — N\(t) > an, 

X 



ay/N^t + 1) < -(v^ViCO + VN (t)) < -V2n. 



□ 



Applying the proposition twice, we see that an imbalance of implies that a large, stable 
majority will form within one time-step. 

Corollary 6.3. If N (t) > Ni(t) + ^ and J < ^ then Ni(s) < | for alls >t+ 1. 

Proof. Taking a = ^ in Proposition we have < f . Then N (t+1) > iVi(t+l) + ^ > 

and so we can continue applying Proposition 16.21 indefinitely with a = ^ . □ 

In order to show that a complete consensus is eventually achieved, we will use a result of [9], 
who proved that majority dynamics will eventually enter a cycle with period at most two. 

Proposition 6.4. If ^ < and No(t) — Ni(t) > for some t, then majority dynamics converge 
to all 0. 
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Proof. Since majority dynamics converge to a cycle with period at most two, we can divide the 
vertices of G into four sets: ^4oo is the set of nodes that converge to 0, An is the set that converge 
to 1, with Aqi and A\q being the two sets of nodes that eventually alternate between and 1. By 
Corollary E31 \Au\ + max{|A i|, \Ai Q \} < §, and so \A% \ = \A U \ + \A 01 \ + |4lo| < j- % the 
expander mixing lemma, 

\E(Al ,Al )\ < |^o| 2 ^ + A|^ol < I^Sol (l + A ) • 

On the other hand, \E(A 00 , A c 00 )\ + \E(A C 0Q , A c 00 )\ = d\A c J and so \E(A 00 , A c 00 )\ > |^ |(f -A). 
Since A < d/4, \E(Aqq, Aq )\ > ^\Aq \. Supposing that Aq is non-empty, there must be at least 
one vertex v G Aq with more than half of its neighbors in ^oo- But then the definition of majority 
dynamics would imply that v converges to 0, a contradiction. Thus Aq must be empty, and all 
agents converge to 0. □ 

In particular, a random d-regular graph has A = O(Vd) with high probability. Therefore, if we 
start with an initial bias such that P(0) — \ > dr x l 2 then iterated majority on a random <i-regular 
graph will converge to all with high probability. 



6.2 Plurality dynamics on expanders 

The results of the previous section can be extended with little effort to the case of more than 
two alternatives. The main obstacle in making this extension is specifying the resolution of ties. 
With two alternatives, we avoid the possibility of ties in majority dynamics simply by requiring 
each vertex to have odd degree. With more than two alternatives, the simplest way to avoid ties 
is to perturb the edge weights slightly so that they are rationally independent. Our expansion 
assumptions can be easily extended to the weighted case: let M be the weighted adjacency matrix 
of G and assume that all of its entries on or above the main diagonal are rationally independent of 
one another. Let d be the largest absolute eigenvalue of M and let A be the second-largest. Note 
that if M was constructed by perturbing the edge weights of a random regular graph, then d will 
be approximately the degree of the graph and A will be 0(^/d). 

With the assumptions above, Lemma 16. 1 1 holds exactly as it was stated above, and so the proof 
of Proposition 16.21 applies also. 

Proposition 6.5. For a € [q], let N a (t) be the number of people that vote a at time t. If N a (t) > 
i±^n then N a (t + 1) > n(l - ^) 

To get an extension of Proposition 16.41 we first need to extend the periodicity result [9] to the 
case of several alternatives. This extension uses exactly the same argument as [9], but we include 
it for completeness. 

Proposition 6.6. On a weighted graph with no ties, iterated plurality dynamics converge to a cycle 
of length at most 2. 

Proof. Consider the quantity 

Jv{t) = I 0-{X v (t+l)=a} ~ 1 {X v (t-l)=a}) ^2 e w" l {X w (t)=a} J , 
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where e wv is the weight of the edge between v and w. Note that J v (t) > with equality if, and 
only if, X v (t + 1) = X v (t — 1). Indeed, if X v (t + 1) = X v (t — 1) then J v (t) = trivially, so suppose 
that X v (t + 1) = a and X v (t — 1) = b ^ a. Then 

{w^v:X w (t)=a} {w^v:X w (t)=b} 

Since X v (t + 1) = a and the edge weights are chosen to ensure that ties never happen, this implies 
that J v (t) > 0. 

Now consider J(t) = J v (t). Note that if we define 

L W = £ ^2 e ^l{X v (t+l)=a} 1 {X w (t)=a} 

then J{t) = L(t) — L(t — 1). Since the state space of the dynamics is finite and the dynamics are 
deterministic, the process eventually (by time T, say) converges to a cycle (of period k, say). Then 

T+k T+k T+k-1 

At) = £ L(t) - £ L(t) = 0, 

t=T+l t=T+l t=T 

since the states are identical at time T and T + k, and thus L(T) = L(T + k). Since J(t) > for 
every t, it follows that J(T + 1) = 0. Then J V {T + 1) = for every v and so the state at time T + 2 
is identical to the state at time T. □ 

With Proposition 16.61 in hand, the rest of the proof of Proposition 16.41 goes through in the 
(/-alternative case. We only note that we need to replace Aqi by the set A a * = {v : X v (2t) = a / 
X v (2t + 1) for large enough t}. 

Proposition 6.7. If ^ < ^ and N a (t) > n{\ + ?f) for some t then the plurality dynamics converge 
to a. 

In particular, if we take a random (i-regular graph and perturb each edge weight by at most 
n , then the second eigenvalue will hardly change, so we will still have A = 0(Vd). If P(X v (0) = 
a) > P{X v {0) = b) + for every b ^ a then at time t = 1, with high probability most of the 

vertices will prefer a and Proposition 16.71 will imply that the plurality dynamics will converge to 
all a. 

6.3 A stronger result for expanders with large girth 

In Section [6.11 we proved that in majority dynamics with two alternatives, an initial bias of d" 1 ^ 2 
is sufficient (on a random d-regular graph) for consensus in the limit. Kanoria and Montanari |17| 
showed that on an infinite d-regular tree, the required bias is much smaller as a function of d: 

Theorem 6.8 (Kanoria and Montanari). Let v be a vertex in an infinite d-regular tree. For any 
(3 > and all sufficiently large d, i/P(0) > ^ + dr$ then with probability one, X v (t) = for all 
sufficiently large t. 

Using this, it is easy to improve our earlier bias requirement for consensus from P(0) — \ > dr 1 ! 2 
to P(0) - \ > d~ p for any /3 > 0: 
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Corollary 6.9. For every d, let G H) d be a sequence of d-regular X-expanders with ^ < jq, such that 
the girth of G n ^ tends to infinity with n. For any (3 > 0, if p > ^ + dr^ then for all sufficiently 
large d, with high probability (as n —> oo) the iterated majority process on G n d will converge to all 
0. 

Proof. Choose d large enough (depending on j3) so that Theorem 16.81 applies, then choose T large 
enough so that P(X V (T) = 0) > ^ + -^= on the d-regular tree, for some constant C to be determined. 
By choosing n large enough, we can ensure that the girth of G n ^ is larger than T; thus ~P(X V (T) = 
0) > ^ + ^= for every v G G n ^- Then the expected fraction of nodes that are by time T is at 

least i + since at time T each node only depends on the initial values of nodes within a ball 
of radius T. Since the number of such nodes is bounded as n — > oo, McDiarmid's inequality |20j 
implies that with high probability, at least \ + fraction of nodes are at time T . If we choose 
C large enough, Proposition 16.41 implies that the dynamics converge to all 0. □ 
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